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(57) A voice recognition system for recognizing an 
input voice of a narrow frequency band. The voice rec- 
ognition system includes : a frequency band converting 



unit for converting the input voice of the narrowfrequen- 
cy band into a pseudo voice of a wide frequency band 
which covers an entirety of the narrow frequency band 
and which is wider than the narrow frequency band. 
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Description 

[0001 ] The present invention relates to a voice recog- 
nition system and a standard pattern preparation sys- 
tem for preparing standard patterns to be used for the 
voice recognition process by the voice recognition sys- 
tem as well as a method of preparing the standard pat- 
terns and a computer program for preparing the stand- 
ard patterns, and more particularly to a voice recognition 
system for recognizing a narrow-band frequency voice 
such as a telephone voice recognition. 
[0002] FIG. 1 is a block diagram illustrative of the con- 
ventional voice recognition system. The conventional 
voice recognition system includes a characteristic ex- 
traction unit 100 and a pattern reference unit 103. The 
characteristic extraction unit 100 receives an input of a 
voice 105 and converts the voice into a characteristic 
vector time series. The pattern reference unit 103 re- 
ceives the characteristic vector time series and com- 
pares the characteristic vector time series with a stand- 
ard pattern 1 04 forthe voice recognition before the pat- 
tern reference unit 1 03 outputs avoice recognition result 
106. This conventional voice recognition system is ad- 
dressed in 1995, entitled "the fundamentals of the voice 
recognition" NTT Advanced Technology. 
[0003] A mercepstrum characteristic extraction may 
be available for the characteristic extraction unit 100. 
The characteristic extraction unit 1 00 further includes a 
power spectrum calculation unit 101 for calculating a 
power spectrum in a short term of the input voice 105 
and a mercepstrum calculation unit 102 receiving the 
power spectrum from the power spectrum calculation 
unit 101 and performing a mer conversion and a cosine 
conversion of a logarithm of the power spectrum, there- 
by extracting a mercepstrum characteristic quantity. 
[0004] FIG. 2 is a block diagram illustrative of a con- 
ventional standard pattern preparation system. The 
conventional standard pattern preparation system pre- 
pares the above described standard pattern 104 to be 
referred by the above described conventional voice rec- 
ognition system shown in FIG. 1. The conventional 
standard pattern preparation system includes a charac- 
teristic extraction unit 200 and a standard pattern prep- 
aration unit 204. The characteristic extraction unit 200 
further includes a power spectrum calculation unit 201 
for calculating a power spectrum in a short term of a 
learning voice signal from a learning voice storing unit 
203, and a mercepstrum calculation unit 202 receiving 
the power spectrum from the power spectrum calcula- 
tion unit 201 and performing a mer conversion and a 
cosine conversion of a logarithm of the power spectrum, 
thereby extracting a mercepstrum characteristic quan- 
tity. 

[0005] The standard pattern preparation unit 204 re- 
ceives the mercepstrum characteristic quantity from the 
mercepstrum calculation unit 202 and prepares a stand- 
ard pattern. The standard pattern is stored in a standard 
pattern storing unit 205. 



[0006] With reference again to FIG. 1 the process for 
recognition of the narrow band frequency voice such as 
the telephone voice by the conventional voice recogni- 
tion system will be described. 

5 [0007] The telephone voice has a narrow frequency 
band, and is likely to receive a substantive influence by 
noises, for which reason it is generally difficult to recog- 
nize the voice. The frequency band of the telephone 
voice is ranged from 300 Hz to 3400 Hz. A first formant 

10 of the vowel or the primary characteristic frequency re- 
gion is important for the voice recognition. This first 
formant of the vowel or the primary characteristic fre- 
quency region exists under 300 Hz, depending on a 
speaker. In this case, the voice signal entered from the 

15 telephone terminal may be free of the first formant of the 
vowel or the primary characteristic frequency region un- 
der 300 Hz. 

[0008] A frequency range of the friction noise may of- 
ten be over 3000 Hz. In this case, the voice signal en- 
20 tered from the telephone terminal may be free of the fric- 
tion noise. 

[0009] The restriction on the frequency band causes 
that the recognition of the telephone voice with the nar- 
row frequency band is lower in accuracy than the rec- 
25 ognition of the microphone voice with a wide frequency 
band. 

[0010] JP-A- 2000-250577 discloses the following 
conventional technique for improving the frequency 
characteristic of the voice with the narrow frequency 

so band entered from the microphone. This conventional 
technique prevents any lack of thevoice information and 
also improves thevoice recognition characteristic in the 
presence of the noises. A characteristic vector is select- 
ed by a first code book from a voice input pattern as 

35 received by a second voice receiver. A correction vector 
is selected from a second code book in correspondence 
with the index of the selected vector. Both the charac- 
teristic vector and the correction vector are then added 
to presume the characteristic vector of the voice re- 

40 ceived by the second receiver which ensures a higher 
voice-receiving sensitivity inawidefrequency band than 
the first receiver. 

[0011] In the above circumstances, the development 
of a novel voice recognition system is desirable. 
45 [0012] Accordingly, it is an object of the present inven- 
tion to provide a novel recognition system free from the 
above problems. 

[0013] It is a further object of the present invention to 
provide a novel recognition system exhibiting such a 
50 high performance of voice recognition in a narrow fre- 
quency band as closely to the performance in the narrow 
frequency band. 

[0014] It is a still further object of the present invention 
to provide a novel standard pattern preparation system 
55 for preparing standard patterns to be used for the voice 
recognition process by the voice recognition system free 
from the above problems. 

[0015] It is yet a further object of the present invention 
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to provide a novel standard pattern preparation system 
for preparing standard patterns to be used for the voice 
recognition process by the voice recognition system ex- 
hibiting such a high performance of voice recognition in 
a narrow frequency band as closely to the performance 
in the narrow frequency band. 

[0016] It is yet a further object of the present invention 
to provide a method of preparing the standard patterns 
free from the above problems. 
[0017] It is yet a further object of the present invention 
to provide a method of preparing the standard patterns 
exhibiting such a high performance of voice recognition 
in a narrow frequency band as closely to the perform- 
ance in the narrow frequency band. 
[0018] It is yet a further object of the present invention 
to provide a computer program for preparing the stand- 
ard patterns free from the above problems. 
[0019] It is yet a further object of the present invention 
to provide a computer program for preparing the stand- 
ard patterns exhibiting such a high performance of voice 
recognition in a narrowfrequency band as closely to the 
performance in the narrow frequency band. 
[0020] The present invention provides a voice recog- 
nition system for recognizing an input voice of a narrow 
frequency band. The voice recognition system 
includes : a frequency band converting unit for convert- 
ing the input voice of the narrow frequency band into a 
pseudo voice of a wide frequency band which covers an 
entirety of the narrowfrequency band and which is wider 
than the narrow frequency band. 
[0021 ] A first aspect of the present invention is a voice 
recognition system for recognizing an input voice of a 
narrow frequency band. The voice recognition system 
includes : a frequency band converting unit for convert- 
ing the input voice of the narrow frequency band into a 
pseudo voice of a wide frequency band which covers an 
entirety of the narrowfrequency band and which is wider 
than the narrow frequency band. 
[0022] It is preferable that the voice recognition sys- 
tem further includes : a power spectrum calculating unit 
for calculating power spectrums of the input voice of the 
narrow frequency band, and that the frequency band 
converting unit further comprises : a specific vector stor- 
ing unit for storing specific vectors of power spectrums 
of a wide frequency band voice ; an expansion coeffi- 
cient calculating unit for calculating expansion coeffi- 
cients that the power spectrums calculated by the power 
spectrum calculating unit are expanded by the specific 
vectors ; and a frequency band expansion unit for cal- 
culating additional power spectrums in a lack frequency 
band by use of the expansion coefficients calculated by 
the expansion coefficient calculating unit, wherethelack 
frequency band is covered by the wide frequency band 
but not covered by the narrow frequency band, and the 
frequency band expansion unit combining the additional 
power spectrums in the lack frequency band into the 
power spectrum of the narrowfrequency band calculat- 
ed by the power spectrum calculating unit to prepare 



pseudo power spectrums of the pseudo voice of the 
wide frequency band. 

[0023] It is also preferable to further include : a pattern 
reference unit for receiving an output from the frequency 
5 band converting unit and comparing the output with a 
standard pattern for carrying out the voice recognition. 
[0024] It is also preferable to further include : a stand- 
ard pattern storing unit for storing the standard patterns 
for allowing the pattern reference unit to compare the 
output with the standard pattern. 
[0025] It is also preferable to further include :a stand- 
ard pattern preparation unit for receiving an output from 
the frequency band converting unit and preparing a 
standard pattern based on the output for carrying out 
the voice recognition. 

[0026] It is also preferable to further include : a stand- 
ard pattern storing unit for storing the standard pattern 
prepared by the standard pattern preparation unit. 
[0027] It is also preferable that the frequency band 
converting unit further comprises : a mercepstrum cal- 
culating unit for calculating a mercepstrum characteris- 
tic quantity based on the pseudo power spectrum pre- 
pared by the frequency band expansion unit. 
[0028] A second aspect of the present invention is a 
standard pattern preparation system for preparing a 
standard pattern to recognize an input voice of a narrow 
frequency band. The standard pattern preparation sys- 
tem includes : a frequency band converting unitfor con- 
verting the input voice of the narrowfrequency band into 
a pseudo voice of a wide frequency band which covers 
an entirety of the narrow frequency band and which is 
wider than the narrow frequency band. 
[0029] It is also preferable that the standard pattern 
preparation system further includes : a power spectrum 
calculating unit for calculating power spectrums of the 
input voice of the narrow frequency band, and that the 
frequency band converting unit further comprises : a 
specific vector storing unit for storing specific vectors of 
power spectrums of a wide frequency band voice ; an 
expansion coefficient calculating unit for calculating ex- 
pansion coefficients that the power spectrums calculat- 
ed by the power spectrum calculating unit are expanded 
by the specific vectors ; and a frequency band expan- 
sion unit for calculating additional power spectrums in a 
lack frequency band by use of the expansion coeffi- 
cients calculated by the expansion coefficient calculat- 
ing unit, where the lack frequency band is covered by 
the wide frequency band but not covered by the narrow 
frequency band, and the frequency band expansion unit 
combining the additional power spectrums in the lack 
frequency band into the power spectrum of the narrow 
frequency band calculated by the power spectrum cal- 
culating unit to prepare pseudo power spectrums of the 
pseudo voice of the wide frequency band. 
[0030] It is also preferable to further include : a pattern 
reference unit for receiving an output from the frequency 
band converting unit and comparing the output with a 
standard pattern for carrying out the voice recognition. 
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[0031] It is also preferable to further include : a stand- 
ard pattern storing unit for storing the standard patterns 
for allowing the pattern reference unit to compare the 
output with the standard pattern. 
[0032] It is also preferable to further include : a stand- 
ard pattern preparation unit for receiving an output from 
the frequency band converting unit and preparing a 
standard pattern based on the output for carrying out 
the voice recognition. 

[0033] It is also preferable to further include : a stand- 
ard pattern storing unit for storing the standard pattern 
prepared by the standard pattern preparation unit. 
[0034] It is also preferable that the frequency band 
converting unit further comprises : a mercepstrum cal- 
culating unit for calculating a mercepstrum characteris- 
tic quantity based on the pseudo power spectrum pre- 
pared by the frequency band expansion unit. 
[0035] A third aspect of the present invention is a 
characteristic extraction system for extracting a mercep- 
strum characteristic quantity of an input voice of a nar- 
row frequency band. The characteristic extraction sys- 
tem includes : a power spectrum calculating unit for cal- 
culating power spectrums of the input voice of the nar- 
row frequency band ; a frequency band converting unit 
for converting the input voice of the narrow frequency 
band into a pseudo voice of a wide frequency band 
which covers an entirety of the narrow frequency band 
and which is wider than the narrowfrequency band ; and 
a mercepstrum calculating unit forcalculating a mercep- 
strum characteristic quantity based on the pseudo voice 
of the wide frequency band. 

[0036] It is also preferable that the frequency band 
converting unitfurther comprises : a specific vector stor- 
ing unit for storing specific vectors of power spectrums 
of a wide frequency band voice ; an expansion coeffi- 
cient calculating unit for calculating expansion coeffi- 
cients that the power spectrums calculated by the power 
spectrum calculating unit are expanded by the specific 
vectors ; and a frequency band expansion unit for cal- 
culating additional power spectrums in a lack frequency 
band by use of the expansion coefficients calculated by 
the expansion coefficient calculating unit, wherethelack 
frequency band is covered by the wide frequency band 
but not covered by the narrow frequency band, and the 
frequency band expansion unit combining the additional 
power spectrums in the lack frequency band into the 
power spectrum of the narrowfrequency band calculat- 
ed by the power spectrum calculating unit to prepare 
pseudo power spectrums of the pseudo voice of the 
wide frequency band. 

[0037] A fourth aspect of the present invention is a 
voice recognition method for recognizing an input voice 
of a narrow frequency band. The voice recognition 
method includes the steps of: converting the input voice 
of the narrow frequency band into a pseudo voice of a 
wide frequency band which covers an entirety of the nar- 
row frequency band and which is wider than the narrow 
frequency band. 



[0038] It is also preferable to further include : calcu- 
lating power spectrums of the input voice of the narrow 
frequency band ; calculating expansion coefficients that 
the power spectrums are expanded by specific vectors ; 
5 and calculating additional powerspectrums in a lackfre- 
quency band by use of the expansion coefficients, 
where the lack frequency band is covered by the wide 
frequency band but not covered by the narrow frequen- 
cy band, so as to combine the additional power spec- 
ie trums in the lack frequency band into the power spec- 
trum of the narrow frequency band, thereby to prepare 
pseudo power spectrums of the pseudo voice of the 
wide frequency band. 

[0039] It is also preferable to further include : calcu- 
15 lating a mercepstrum characteristic quantity based on 
the pseudo power spectrum. 

[0040] It is also preferable to further include : compar- 
ing the mercepstrum characteristic quantity with a 
standard pattern for carrying out the voice recognition. 

20 [0041] A fifth aspect of the present invention is a 
standard pattern preparation method for preparing a 
standard pattern to recognize an input voice of a narrow 
frequency band. The standard pattern preparation 
method includes the steps of : converting the input voice 

25 of the narrow frequency band into a pseudo voice of a 
widefrequency band which covers an entirety of the nar- 
row frequency band and which is wider than the narrow 
frequency band. 

[0042] It is also preferable to further include : calcu- 
30 lating power spectrums of the input voice of the narrow 
frequency band ; calculating expansion coefficients that 
the power spectrums are expanded by specific vectors ; 
and calculating additional powerspectrums in a lackfre- 
quency band by use of the expansion coefficients, 
35 where the lack frequency band is covered by the wide 
frequency band but not covered by the narrow frequen- 
cy band, and combining the additional powerspectrums 
in the lack frequency band into the power spectrum of 
the narrow frequency band to prepare pseudo power 
40 spectrums of the pseudo voice of the wide frequency 
band. 

[0043] It is also preferable to further include : calcu- 
lating a mercepstrum characteristic quantity based on 
the pseudo power spectrum. 

45 [0044] It is also preferable to further include : compar- 
ing the mercepstrum characteristic quantity with a 
standard pattern for carrying out the voice recognition. 
[0045] A sixth aspect of the present invention is a 
characteristic extraction method for extracting a mer- 

50 cepstrum characteristic quantity of an input voice of a 
narrow frequency band. The characteristic extraction 
method includes the steps of : calculating power spec- 
trums of the input voice of the narrow frequency band ; 
converting the input voice of the narrowfrequency band 

55 into a pseudo voice of a widefrequency band which cov- 
ers an entirety of the narrow frequency band and which 
is wider than the narrow frequency band ; and calculat- 
ing a mercepstrum characteristic quantity based on the 
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pseudo voice of the wide frequency band. 
[0046] It is also preferable to further include : calcu- 
lating expansion coefficients that the power spectrums 
are expanded by specific vectors ; and calculating ad- 
ditional power spectrums in a lack frequency band by 
use of the expansion coefficients, where the lack fre- 
quency band is covered by the wide frequency band but 
not covered by the narrowfrequency band, and combin- 
ing the additional power spectrums in the lack frequency 
band into the power spectrum of the narrow frequency 
band to prepare pseudo power spectrums of the pseudo 
voice of the wide frequency band. 
[0047] In accordance with the present invention, the 
characteristic extraction unit for analyzing the input 
voice signal and extracting the characteristic quantity 
further includes a frequency band conversion unit for 
converting the narrow frequency band power spectrum 
into a wide frequency band power spectrum. 
[0048] The frequency band conversion is executed for 
preparing the standard pattern in order to reduce the 
mismatch between the standard pattern and the input 
voice. 

[0049] The characteristic extraction unit for analyzing 
the input voice signal and extracting the characteristic 
quantity further includes a frequency band conversion 
unit for converting the narrow frequency band power 
spectrum into a wide frequency band power spectrum. 
The frequency band conversion unit further includes a 
characteristic vector storing unit for storing characteris- 
tic vectors of a power spectrum of the wide frequency 
band voice. A plurality of power spectrums of the wide 
frequency band voice are gathered for carrying out a 
main component analysis to the power spectrums in or- 
der to prepare the characteristic vectors. The character- 
istic vectors are then stored in the characteristic vector 
storing unit. Any power spectrums of the wide frequency 
band voice are represented by a linear combination of 
the characteristic vectors. 

[0050] Short time power spectrums of a narrow fre- 
quency band voice of the input voice signal are calcu- 
lated by a power spectrum calculating unit. The frequen- 
cy band conversion unit furthermore includes an expan- 
sion coefficient calculation unit forcalculating an expan- 
sion coefficient, wherein the calculated power spectrum 
of the narrow frequency band is expanded by the char- 
acteristic vector of the wide frequency band stored in 
the characteristic vector storing unit. 
[0051] The frequency band conversion unitmoreover 
includesafrequencyband expansion unitforcalculating 
a pseudo wide frequency band voice power spectrum 
which is expanded by the expansion coefficient from the 
input narrowfrequency band power spectrum. In details, 
the frequency band expansion unit calculates a power 
spectrum of a lack frequency band by utilizing the ex- 
pansion coefficient, and combines the calculated power 
spectrum of the lack frequency band into the already cal- 
culated narrow frequency band spectrum, so that the 
frequency band expansion unit calculates the pseudo 



wide frequency band voice power spectrum. 
[0052] The voice recognition system further includes 
a mercepstrum calculating unit for calculating the mer- 
cepstrum as the mercepstrum characteristic quantity by 
5 utilizing the above described pseudo wide frequency 
band voice power spectrum. 

[0053] The expansion coefficient calculation process 
by the expansion coefficient calculation unit and the fre- 
quency band expansion process by the frequency band 

10 expansion unit may be realized by programs executed 
by the computer. The programs may be stored in a com- 
puter-readable storage medium such as a semiconduc- 
tor memory or a magnetic recording medium. 
[0054] The above and other objects, features and ad- 

15 vantages of the present invention will be apparent from 
the following descriptions. 

[0055] Preferred embodiments according to the 
present invention will be described in detail with refer- 
ence to the accompanying drawings. 
20 [0056] FIG.1 is a block diagram illustrative of the con- 
ventional voice recognition system. 
[0057] FIG. 2 is a block diagram illustrative of a con- 
ventional standard pattern preparation system. 
[0058] FIG. 3 is a block diagram illustrative of a first 
25 novel voice recognition system in a first embodiment in 
accordance with the present invention. 
[0059] FIG. 4 is a block diagram illustrative of the 
structure of the characteristic extraction unit included in 
the novel voice recognition system of FIG. 3. 
30 [0060] FIG. 5 is a block diagram illustrative of a first 
novel voice recognition system in a second embodiment 
in accordance with the present invention. 
[0061] FIG. 6 is a block diagram illustrative of the 
structure of the characteristic extraction unit included in 
35 the novel voice recognition system of FIG. 5. 

FIRST EMBODIMENT: 

[0062] Afirst embodiment according to the present in- 
40 vention will be described in detail with reference to the 
drawings. FIG. 3 is a block diagram illustrative of a first 
novel voice recognition system in a first embodiment in 
accordance with the present invention. 
[0063] The voice recognition system includes a char- 
ts acteristic extraction unit 1 0, a pattern reference unit 14. 
The characteristic extraction unit 10 receives an input 
of a voice 1 and performs the following processes. The 
pattern reference unit 2 receives the output from the 
characteristic extraction unit 10 and compares the out- 
50 put with a standard pattern 15 for the voice recognition. 
[0064] The characteristic extraction unit 10 is provid- 
ed for analyzing the input voice signal 1 and extracting 
the characteristic quantity. The characteristic extraction 
unit 10 further includes a frequency band conversion 
55 unit 13 for converting the narrow frequency band power 
spectrum into a wide frequency band power spectrum. 
FIG. 4 is a block diagram illustrative of the structure of 
the characteristic extraction unit included in the novel 
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voice recognition system of FIG. 3. 
[0065] The frequency band conversion unit 13 further 
includes a characteristic vector storing unit 33 for storing 
characteristic vectors of a power spectrum of the wide 
frequency band voice. A plurality of power spectrums of 
the wide frequency band voice are gathered for carrying 
out a main component analysis to the power spectrums 
in order to prepare the characteristic vectors. The char- 
acteristic vectors are then stored in the characteristic 
vector storing unit 33. Any power spectrums of the wide 
frequency band voice are represented by a linear cou- 
pling of the characteristic vectors. 
[0066] Short time power spectrums of a narrow fre- 
quency band voice of the input voice signal are calcu- 
lated by a power spectrum calculating unit. 
[0067] Thefrequency band conversion unit 13 further- 
more includes an expansion coefficient calculation unit 
31 for calculating an expansion coefficient, wherein the 
calculated power spectrum of the narrow frequency 
band is expanded by the characteristic vector of the 
wide frequency band stored in the characteristic vector 
storing unit 33. 

[0068] The frequency band conversion unit 13 more- 
over includes a frequency band expansion unit 32 for 
calculating a pseudo wide frequency band voice power 
spectrum which is expanded by the expansion coeffi- 
cient from the input narrow frequency band power spec- 
trum. In details, the frequency band expansion unit 32 
calculates a power spectrum of a lack frequency band 
by utilizing the expansion coefficient, and combines the 
calculated power spectrum of the lack frequency band 
into the already calculated narrowfrequency band spec- 
trum, so that the frequency band expansion unit 32 cal- 
culates the pseudo wide frequency band voice power 
spectrum. 

[0069] With reference back to FIG. 3, the character- 
istic extraction unit 10 further includes a mercepstrum 
calculating unit 12 for calculating the mercepstrum as 
the mercepstrum characteristic quantity by utilizing the 
above described pseudo wide frequency band voice 
power spectrum. 

[0070] The expansion coefficient calculation process 
by the expansion coefficient calculation unit 31 and the 
frequency band expansion process by the frequency 
band expansion unit 32 may be realized by programs 
executed by the computer. The programs may be stored 
in a computer-readable storage medium such as a sem- 
iconductor memory or a magnetic recording medium. 
[0071 ] Operations of the above described novel voice 
recognition system will be described. The input voice 1 
having a restricted narrow frequency band is inputted 
into the power spectrum calculating unit 1 1 . The power 
spectrum calculating unit 11 calculates the narrow band 
power spectrums. 

{N(k)}, k=K1 -K2 (1) 



where N(k) is the power spectrum of the voice at a fre- 
quency "k", and K1 - K2 correspond to upper and lower 
limits of the narrow band frequencies. In case of the tel- 
ephone voice, K1 is 300Hz and K2 is 3400Hz. 
5 [0072] The frequency band conversion unit 13 con- 
verts the narrow frequency band power spectrum into 
the wide frequency band power spectrum. 

10 {W(k)},k=K0-K3 (2) 

where W(k) is the power spectrum of the voice at a fre- 
quency "k", and K0 - K3 correspond to upper and lower 
limits of the wide band frequencies. 

15 

K0 < K1 < K2 < K3 (3) 

[0073] The mercepstrum calculating unit 12 calcu- 
20 lates the mercepstrum by utilizing the wide frequency 
band power spectrum W(k) without utilizing the narrow 
frequency band power spectrum N(k). The voice infor- 
mations in the narrow frequency band K0-K1 and the 
wide frequency band K2-K3 are utilized for improving 
25 the accuracy of the voice recognition. 

[0074] With reference to FIG. 4, the characteristic 
vector storing unit 33 stores the characteristic vectors 
ofthepowerspectrumofthewidefrequency band voice. 

30 

{<Ki, k)}, i= 1 - M, k= K0 - K3 (4) 

where "M" is the number of the characteristic vectors 
which have already been stored in the characteristic 

35 vector storing unit 33. The characteristic vectors have 
been prepared by gathering power spectrums of the 
wide frequency voice and the main component analysis. 
[0075] The wide frequency band power spectrum W 
(k) may be represented by a linear combination of the 

40 characteristic vectors. 

W(k) = E i=1 . M c(i)(l)(i,k) ) k=K0-K3 (5) 

45 where c(i) is the expansion coefficient. 

[0076] The expansion coefficient calculation unit 31 
calculates the expansion coefficient, wherein the narrow 
frequency band power spectrum {N(k)} from the power 
spectrum calculating unit 1 1 is expanded by the charac- 

50 teristic vector of the wide frequency band power spec- 
trum stored in the characteristic vector storing unit 33. 

{N(k)}, k=K1 -K2 (6) 

55 

[0077] The expansion coefficient b(i) is calculated to 
minimize the squared error "E" represented by: 
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E = £ k = K1,K2;i = 1-M{ N ( k )- Sb (Wak)} 2 (/) 

[0078] The frequency band expansion unit 32 re- 
ceives the expansion coefficient b(i) from the expansion 
coefficient calculation unit 31 , and calculates the wide 
frequency band power spectrum W(k) by use of the fol- 
lowing equation including the expansion coefficient b(i). 

W(k) = Z j=1 . M b(i)(Ki ) k) ) k=K0-K3 (8) 

[0079] The mercepstrum calculating unit 12 receives 
the wide frequency band power spectrum W(k) from the 
frequency band expansion unit 32, and calculates the 
mercepstrum characteristic quantity. 
[0080] In the above description, the expansion coef- 
ficient b(i) is calculated to minimizethe above described 
squared error "E". Alternatively, the expansion coeffi- 
cient b(i) may optionally be calculated to maximize the 
likelihood of Hidden Markov Model to the input voice. 
[0081 ] It is also possible that both the narrowf requen- 
cy band power spectrum and the wide frequency band 
power spectrum are calculated from the voice received 
in both the narrow and wide frequency bands. In the nar- 
row frequency band, the narrow frequency band power 
spectrum is used. For the lack frequency region in the 
narrow frequency band, parts of the wide frequency 
band power spectrum corresponding to the lack fre- 
quency region are combined into the narrow frequency 
band power spectrum, so that the combination of the 
narrow frequency band power spectrum with the parts 
of the wide frequency band power spectrum form the 
wide frequency band power spectrum. The wide fre- 
quency band power spectrum is then subjected to the 
main component analysis to obtain the specific vectors. 
These specific vectors may be used for calculating the 
wide frequency band power spectrum W(k). 

SECOND EMBODIMENT: 

[0082] A second embodiment according to the 
present invention will be described in detail with refer- 
ence to the drawings. FIG. 5 is a block diagram illustra- 
tive of a first novel voice recognition system in a second 
embodiment in accordance with the present invention. 
[0083] Thevoice recognition system includes a learn- 
ing voice storing unit 24, a characteristic extraction unit 
20, a standard pattern preparation unit 25. The charac- 
teristic extraction unit 10 receives an input of a voice 1 
and performs the following processes. The standard 
pattern preparation unit 25 receives the output from the 
characteristic extraction unit 20 and prepares a stand- 
ard pattern 26 for the voice recognition. 
[0084] The characteristic extraction unit 20 further in- 
cludes a frequency band conversion unit 23 for convert- 
ing the narrow frequency band power spectrum into a 



wide frequency band power spectrum. FIG. 6 is a block 
diagram illustrative of the structure of the characteristic 
extraction unit included in the novel voice recognition 
system of FIG. 5. 

5 [0085] The frequency band conversion unit 23 further 
includes the characteristic vector storing unit 33 for stor- 
ing characteristic vectors of a power spectrum of the 
wide frequency band voice. A plurality of power spec- 
trums of the wide frequency band voice are gathered for 

10 carrying out a main component analysis to the power 
spectrums in orderto prepare the characteristic vectors. 
The characteristic vectors are then stored in the char- 
acteristic vector storing unit 33. Any power spectrums 
of the wide frequency band voice are represented by a 

15 linear coupling of the characteristic vectors. Short time 
power spectrums of a narrow frequency band voice of 
the input voice signal are calculated by a power spec- 
trum calculating unit. 

[0086] Thefrequency band conversion unit23 further- 

20 more includes the expansion coefficient calculation unit 
31 for calculating an expansion coefficient, wherein the 
calculated power spectrum of the narrow frequency 
band is expanded by the characteristic vector of the 
wide frequency band stored in the characteristic vector 

25 storing unit 33. 

[0087] The frequency band conversion unit 23 more- 
over includes the frequency band expansion unit 32 for 
calculating a pseudo wide frequency band voice power 
spectrum which is expanded by the expansion coeffi- 

30 cient from the input narrow frequency band power spec- 
trum. In details, the frequency band expansion unit 32 
calculates a power spectrum of a lack frequency band 
by utilizing the expansion coefficient, and combines the 
calculated power spectrum of the lack frequency band 

35 into the already calculated narrow frequency band spec- 
trum, so that the frequency band expansion unit 32 cal- 
culates the pseudo wide frequency band voice power 
spectrum. 

[0088] With reference back to FIG. 5, the character- 

40 istic extraction unit 20 further includes the mercepstrum 
calculating unit 22 for calculating the mercepstrum as 
the mercepstrum characteristic quantity by utilizing the 
above described pseudo wide frequency band voice 
power spectrum. The structure of the frequency band 

45 conversion unit 23 is substantially the same as in the 
first embodiment with reference to FIG. 4. 
[0089] The standard pattern preparation unit 25 pre- 
pares the standard pattern based on the wide frequency 
band power spectrums of the voice to improve the ac- 

50 curacy of the standard pattern. 

[0090] The learning voice storing unit 24 may store 
both the narrow frequency band voice and the wide fre- 
quency band voice. If the wide frequency band voice is 
stored in the learning voice storing unit 24 ; then a narrow 

55 frequency part is extracted from the wide frequency 
band power spectrum, and this extracted narrow fre- 
quency part is then converted by the frequency band 
conversion unit 23 into the pseudo wide frequency band 
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power spectrum. 

[0091] Although the invention has been described 
above in connection with several preferred embodi- 
ments therefor, it will be appreciated that those embod- 
iments have been provided solely for illustrating the in- 
vention, and not in a limiting sense. Numerous modifi- 
cations and substitutions of equivalent materials and 
techniques will be readily apparent to those skilled in the 
art after reading the present application, and all such 
modifications and substitutions are expressly under- 
stood to fall within the true scope and spirit of the ap- 
pended claims. 



Claims 

1 . A voice recognition system for recognizing an input 
voice of a narrow frequency band, said voice rec- 
ognition system including: 

a frequency band converting unit for converting 
said input voice of said narrow frequency band 
into a pseudo voice of a wide frequency band 
which covers an entirety of said narrow fre- 
quency band and which is wider than said nar- 
row frequency band. 

2. The voice recognition system as claimed in claim 1 , 
characterized in that said voice recognition sys- 
tem further includes : a power spectrum calculating 
unit for calculating power spectrums of said input 
voice of said narrow frequency band, and 

characterized in that said frequency band 
converting unit further comprises: 

a specific vector storing unit for storing specific 
vectors of powerspectrums of a wide frequency 
band voice ; 

an expansion coefficient calculating unitforcal- 
culating expansion coefficients that said power 
spectrums calculated by said power spectrum 
calculating unit are expanded by said specific 
vectors ; and 

a frequency band expansion unitforcalculating 
additional powerspectrums in a lack frequency 
band by use of said expansion coefficients cal- 
culated by said expansion coefficient calculat- 
ing unit, where said lack frequency band is cov- 
ered by said wide frequency band but not cov- 
ered by said narrow frequency band, and said 
frequency band expansion unit combining said 
additional power spectrums in said lack fre- 
quency band into said power spectrum of said 
narrowfrequency band calculated by said pow- 
er spectrum calculating unit to prepare pseudo 
power spectrums of said pseudo voice of said 
wide frequency band. 



3. A standard pattern preparation system for prepar- 
ing a standard pattern to recognize an input voice 
of a narrowfrequency band, said standard pattern 
preparation system including : 

5 

a frequency band converting unit for converting 
said input voice of said narrowfrequency band 
into a pseudo voice of a wide frequency band 
which covers an entirety of said narrow fre- 
10 quency band and which is wider than said nar- 

row frequency band. 

4. The standard pattern preparation system as 
claimed in claim 3, characterized in that said 

15 standard pattern preparation system further 
includes : a power spectrum calculating unitforcal- 
culating power spectrums of said input voice of said 
narrowfrequency band, and 

characterized in that said frequency band 

20 converting unit further comprises: 

a specific vector storing unit for storing specific 
vectors of powerspectrums of a wide frequency 
band voice; 

25 an expansion coefficient calculating unitforcal- 

culating expansion coefficients that said power 
spectrums calculated by said power spectrum 
calculating unit are expanded by said specific 
vectors ; and 

so a frequency band expansion unitforcalculating 

additional power spectrums in a lack frequency 
band by use of said expansion coefficients cal- 
culated by said expansion coefficient calculat- 
ing unit, where said lackfrequency band is cov- 
35 ered by said wide frequency band but not cov- 

ered by said narrow frequency band, and said 
frequency band expansion unit combining said 
additional power spectrums in said lack fre- 
quency band into said power spectrum of said 
40 narrow frequency band calculated by said pow- 

er spectrum calculating unit to prepare pseudo 
power spectrums of said pseudo voice of said 
wide frequency band. 

45 5. The standard pattern preparation system as 
claimed in claim 2, 3 or 4 further including: 

a pattern reference unit for receiving an output 
from said frequency band converting unit and 
50 comparing said output with a standard pattern 

for carrying out the voice recognition. 

6. The standard pattern preparation system as 
claimed in claim 5, further including : 



a standard pattern storing unit for storing said 
standard patterns for allowing said pattern ref- 
erence unit to compare said output with said 
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standard pattern. 

7. The standard pattern preparation system as 
claimed in claim 2, 3 or 4 further including: 

a standard pattern preparation unit for receiv- 
ing an output from said frequency band con- 
verting unit and preparing a standard pattern 
based on said output for carrying out the voice 
recognition. 

8. The standard pattern preparation system as 
claimed in claim 7, further including : 

a standard pattern storing unit for storing the 
standard pattern prepared by the standard pat- 
tern preparation unit. 

9. The system as claimed in any one of claim 2 to 8, 

wherein said frequency band converting unit 
further comprises: 

a mercepstrum calculating unit for calculating 
a mercepstrum characteristic quantity based 
on said pseudo power spectrum prepared by 
said frequency band expansion unit. 

10. A characteristic extraction system for extracting a 
mercepstrum characteristic quantity of an input 
voice of a narrow frequency band, said character- 
istic extraction system including : 

a power spectrum calculating unit for calculat- 
ing power spectrums of said input voice of said 
narrow frequency band ; 
a frequency band converting unit for converting 
said input voice of said narrow frequency band 
into a pseudo voice of a wide frequency band 
which covers an entirety of said narrow fre- 
quency band and which is wider than said nar- 
row frequency band ; and 
a mercepstrum calculating unit for calculating 
a mercepstrum characteristic quantity based 
on said pseudo voice of said wide frequency 
band. 

11. The characteristic extraction system as claimed in 
claim 10, characterized in that said frequency 
band converting unit further comprises : 

a specific vector storing unit for storing specific 
vectors of powerspectrums of a wide frequency 
band voice ; 

an expansion coefficientcalculating unitforcal- 
culating expansion coefficients that said power 
spectrums calculated by said power spectrum 
calculating unit are expanded by said specific 
vectors ; and 



a frequency band expansion unit for calculating 
additional power spectrums in a lack frequency 
band by use of said expansion coefficients cal- 
culated by said expansion coefficient calculat- 
5 ing unit, where said lackfrequency band is cov- 

ered by said wide frequency band but not cov- 
ered by said narrow frequency band, and said 
frequency band expansion unit combining said 
additional power spectrums in said lack fre- 
10 quency band into said power spectrum of said 

narrow frequency band calculated by said pow- 
er spectrum calculating unit to prepare pseudo 
power spectrums of said pseudo voice of said 
wide frequency band. 

15 

12. A voice recognition method for recognizing an input 
voice of a narrow frequency band, said voice rec- 
ognition method including the steps of: 

20 converting said input voice of said narrow fre- 

quency band into a pseudo voice of a wide fre- 
quency band which covers an entirety of said 
narrow frequency band and which is wider than 
said narrow frequency band. 

25 

13. The voice recognition method as claimed in claim 
11 2, further comprising the steps of 

calculating power spectrums of said input 
voice of said narrow frequency band; 

so calculating expansion coefficients that said 

power spectrums are expanded by specific 
vectors ; and 

calculating additional power spectrums in a 
lack frequency band by use of said expansion co- 

35 efficients, where said lack frequency band is cov- 
ered by said wide frequency band but not covered 
by said narrow frequency band, so as to combine 
said additional power spectrums in said lack fre- 
quency band into said power spectrum of said nar- 

40 row frequency band, thereby to prepare pseudo 
power spectrums of said pseudo voice of said wide 
frequency band. 

14. The voice recognition method as claimed in claim 
45 13, further comprising the step of: 

calculating a mercepstrum characteristic quan- 
tity based on said pseudo power spectrum. 

50 15. The voice recognition method as claimed in claim 
14, further including the steps of: 

comparing said mercepstrum characteristic 
quantity with a standard pattern for carrying out 
55 the voice recognition. 

16. A standard pattern preparation method for prepar- 
ing a standard pattern to recognize an input voice 
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claim 20, further comprising the steps of: 

calculating expansion coefficients that said 
power spectrums are expanded by specific 
vectors ; and 

calculating additional power spectrums in a 
lack frequency band by use of said expansion 
coefficients, where said lack frequency band is 
covered by said wide frequency band but not 
covered by said narrow frequency band, and 
combining said additional power spectrums in 
said lack frequency band into said power spec- 
trum of said narrow frequency band to prepare 
pseudo power spectrums of said pseudo voice 
of said wide frequency band, 
power spectrums are expanded by specific 
vectors ; and 

calculating additional power spectrums in a 
lack frequency band by use of said expansion 
coefficients, where said lack frequency band is 20 
covered by said wide frequency band but not 
covered by said narrow frequency band, and 
combining said additional power spectrums in 
said lackfrequency band into said power spec- 
trum of said narrow frequency band to prepare 25 
pseudo power spectrums of said pseudo voice 
of said wide frequency band. 

18. The standard pattern preparation method as 
claimed in claim 1 7, further comprising the step of: so 

calculating a mercepstrum characteristic quan- 
tity based on said pseudo power spectrum. 

19. The standard pattern preparation method as 35 
claimed in claim 18, further including the step of : 

comparing said mercepstrum characteristic 
quantity with a standard pattern for carrying out 
the voice recognition. 40 

20. A characteristic extraction method for extracting a 
mercepstrum characteristic quantity of an input 
voice of a narrow frequency band, said character- 
istic extraction method including the steps of: 45 

calculating power spectrums of said input voice 
of said narrow frequency band; 
converting said input voice of said narrow fre- 
quency band into a pseudo voice of a wide fre- so 
quency band which covers an entirety of said 
narrowfrequency band and which is wider than 
said narrowfrequency band; and 
calculating a mercepstrum characteristic quan- 
tity based on said pseudo voice of said wide 55 
frequency band. 

21. The characteristic extraction method as claimed in 



of a narrow frequency band, said standard pattern 
preparation method including the steps of: 

converting said input voice of said narrow fre- 
quency band into a pseudo voice of a wide fre- 5 
quency band which covers an entirety of said 
narrowfrequency band and which is wider than 
said narrow frequency band. 

17. The standard pattern preparation method as 10 
claimed in claim 16, further including the steps of: 

calculating power spectrums of said input voice 

of said narrow frequency band; 

calculating expansion coefficients that said 15 
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FIG. 6 
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